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1 MERSENNE NUMBERS AND PRIMITIVE PRIME DIVISORS. 

A notorious problem from elementary number theory is the "Mersenne Prime 
Conjecture." This asserts that the Mersenne sequence M — (M n ) defined by 

M n = 2 n -l (n = l,2,...) 

contains infinitely many prime terms, which are known as Mersenne primes. 

The Mersenne prime conjecture is related to a classical problem in number 
theory concerning perfect numbers. A whole number is said to be perfect if, 
like 6=1 + 2 + 3 and 28 = f + 2 + 4 + 7 + f 4, it is equal to the sum of all its 
proper divisors. Euclid pointed out that 2 k ~ 1 (2 k — 1) is perfect whenever 2 fc — 1 
is prime. A much less obvious result, due to Euler, is a partial converse: if n 
is an even perfect number, then it must have the form 2 fc ~ 1 (2 fc — 1) for some k 
with the property that 2 k — 1 is a prime. Whether there are any odd perfect 
numbers remains an open question. Thus finding Mersenne primes amounts to 
finding (even) perfect numbers. 

The sequence M certainly produces some primes initially, for example, 

M 2 = 3, M 3 = 7, M 5 = 31, M 7 = 127, .... 

However, the appearance of Mersenne primes quickly thins out: only forty-three 
are known, the largest of which, M^q^q2, 457, has over nine million decimal digits. 
This was discovered by a team at Central Missouri State University as part of 
the GIMPS project [221, which harnesses idle time on thousands of computers 
all over the world to run a distributed version of the Lucas-Lehmer test. 

A paltry forty-three primes might seem rather a small return for such a huge 
effort. Anybody looking for gold or gems with the same level of success would 
surely abandon the search. It seems fair to ask why we should expect there to 
be infinitely many Mersenne primes. In the absence of a rigorous proof, our 
expectations may be informed by heuristic arguments. In section 3 we discuss 
heuristic arguments for this and other more or less tractable problems in number 
theory. 

Primitive prime divisors. In 1892, Zsigmondy [21] discovered a beautiful 
argument that shows that the sequence M does yield infinitely many prime 
numbers — but in a less restrictive sense. Given any integer sequence S — 
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(5'n)n>i, we define a primitive divisor of the term S n (=/= 0) to be a divisor 
of S n that is coprime to every nonzero term S m with m < n. Any prime factor 
of a primitive divisor is called a primitive prime divisor. Factoring the first 
few terms of the Mersenne sequence reveals several primitive divisors, shown in 
bold in Tabled Notice that the term Mq has no primitive divisor, but each 

Table 1: Primitive divisors of (M n ). 
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of the other early terms has at least one. Zsigmondy [23j proved that all the 
terms M n (n > 6) have primitive divisors. He also proved a similar result for 
more general sequences U — (U n ) n ^i, namely, those of the form U n = a n — b n , 
where a and b (a > b) are positive coprime integers: U n has a primitive divisor 
unless a = 2, b = 1 and n = 6ora + 6isa power of 2 and n = 2. 

Apart from the special situation in which a — b = 1, it is not reasonable to 
expect the terms U„ — a n — b n ever to be prime, since the identity 

a n - b n = {a- b){a n - 1 + a n ~ 2 b + a n - 3 b 2 + ■■■ + b^ 1 ) 

shows that U n is divisible by a — b. However, it does seem likely that for any co- 
prime starting values a and b infinitely many terms of the sequence (U n /(a — b)) 
might be prime. Sadly, no proof of this plausible statement is known for even a 
single pair of starting values. 

Although Zsigmondy's result is much weaker than the Mersenne prime con- 
jecture, it initiated a great deal of interest in the arithmetic of such sequences 
(see ^| chap. 6]). It has also been applied in finite group theory (see Praeger |T5] . 
for example). Schinzel jXH] extended Zsigmondy's result, giving further in- 
sight into the finer arithmetic of sequences like M. For example, he proved 
that M^k has a composite primitive divisor for all odd k greater than five. 

2 RECURRENCE SEQUENCES. For most people their first introduction 
to the Fibonacci sequence 

A 1 = 1,A 2 = 1, A 3 = 2, Ai = 3, A 5 = 5, A& = 8, . . . 

is through the (binary) linear recurrence relation 

A n +2 = A n+ i + A„. 
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Sequences such as the Mersenne sequence M and those considered by Zsigmondy 
are of particular interest because they also satisfy binary recurrence relations. 
The terms U n — a n — b n satisfy the recurrence 

U n+2 = (a + b)U n+ i - abll n (n = 1, 2, . . . ). 

More generally, let u and v denote conjugate quadratic integers (i.e., zeros 
of a monic irreducible quadratic polynomial with integer coefficients) . Consider 
the integer sequences U (it, v) and V(u, v) defined by 

U n (u,v) = {u n -v n )/{u-v), V n (u,v) =u n + v n . 

For instance, the Fibonacci sequence is given by 

A n = U n ^-J^C t —^L\. 

The sequence U(u, v) satisfies the recurrence relation 

U n+2 = (u + v)U n+1 - uvU n (n = 1, 2, . . .), 

and V(u, v) satisfies the same relation. 

Some powerful generalizations of Zsigmondy's theorem have been obtained 
for these sequences. Bilu, Hanrot, and Voutier [3| used methods from Diophan- 
tine analysis to prove that both U n (u,v) and V n {u,v) have primitive divisors 
once n > 30. The two striking aspects of this result are the uniform nature of 
the bound and its small numerical value. In particular, for any given sequence 
it is easy to check the first thirty terms for primitive divisors, arriving at a 
complete picture. For example, an easy calculation reveals that the Fibonacci 
number A n does not have a primitive divisor if and only ifn.= l,2,6, or 12. 

Bilinear recurrence sequences. The theory of linear recurrence sequences 
has a bilinear analogue. For example, the Somos-4 sequence S = {S n ) is given 
by the bilinear recurrence relation 

^n+4^n = Sn+sSn+l + ( n = 1; 2, . . . ), 

with the initial condition Si = S2 = S3 = S4 = 1. This sequence begins 

1, 1, 1, 1, 2, 3, 7, 23, 59, 314, 1 529, 8 209, 833 313, 620 297, 7 869 898, .... 

Amazingly, all the terms are integers even though calculating S' n +4 a priori in- 
volves dividing by S n . This sequence was discovered by Michael Somos |20| . 
and it is known to be associated with the arithmetic of elliptic curves (see [101 
sees. 10.1, 11.1] for a summary of this, and further references, including a re- 
markable observation due to Propp et al. that the terms of the sequence must 
be integers because they count matchings in a sequence of graphs.) 

Amongst the early terms of S are several primes: of those that we listed, 

2,3,7,23,59,8 209,620 297 
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are prime. It seems natural to ask whether there are infinitely many prime 
terms in the Somos-4 sequence. More generally, consider integer sequences S 
satisfying relations of the type 

S n +4S n = e«SVi+3>SVi+l + f$n+2' (•"■) 

where e and / are integral constants not both zero. Such sequences are of- 
ten called Somos sequences (or bilinear recurrence sequences) and Christine 
Swart |23 j building on earlier remarks of Nelson Stephens, showed how they 
are related to the arithmetic of elliptic curves. Some care is needed because, 
for example, a binary linear recurrence sequence always satisfies some bilinear 
recurrence relation of this kind. We refer to a Somos sequence as nonlinear if 
it does not satisfy any linear recurrence relation. These are natural general- 
izations of linear recurrence sequences, so perhaps we should expect them to 
contain infinitely many prime terms. Computational evidence in [J] tended to 
support that belief because of the relatively large primes discovered. However, 
a heuristic argument (discussed later) using the prime number theorem was 
adapted in 0, and it suggested that a nonlinear Somos sequence should contain 
only finitely many prime terms. See jH] for proofs in some special cases. 

On the other hand, Silverman established a qualitative analogue of Zsig- 
mondy's result for elliptic curves that applies, in particular, to the Somos-4 
sequence. An explicit form of this result proved by Everest, McLaren, and 
Ward [8: guarantees that from S 5 onwards all terms have primitive divisors. 
There are many nonlinear Somos sequences to which Silverman's proof does 
not apply. A version of Zsigmondy's theorem valid for these sequences awaits 
discovery. 

Polynomials. Given the previous sections, it might be tempting to think that 
all integral recurrence sequences have primitive divisors from some point on. 
However, it is easy to write down counterexamples. The sequence T = (T n ) 
defined by T n = n, which satisfies 

T n +2 = 2T„ + i — T n , 

is a binary linear recurrence sequence that does not always produce primitive 
divisors. This is a rather trivial counterexample, so consider now the sequence P 
defined by 

P n = n 2 + (3, 

where [3 is a nonzero integer. The terms of this sequence satisfy the linear 
recurrence relation 

Pn+3 — 3P n +2 — 3P n _|_i + P n . 

It has long been suspected that for any fixed (3 such that —j3 is not a square the 
sequence P contains infinitely many prime terms. A proof is known for not even 
one value of (3. It seems reasonable to ask the apparently simpler question about 
the existence of primitive divisors of terms. Clearly any prime term is itself a 
primitive divisor, but do the composite terms have primitive divisors? Using 
a result of Schinzel about the largest prime factor of the terms in polynomial 
sequences it is fairly easy to prove the following: 



4 



Theorem 2.1. If —/3 is not a square, then there are infinitely many terms of 
the sequence P that do not have primitive divisors. 

We prove Theorem l2.1l in section 4. Computations suggest that the following 
stronger result should be true. 

Conjecture 2.2. Suppose that —j3 is not a square. If Pfs(N) denotes the 
number of terms P n in the sequence P with n < N that have primitive divisors, 
then 

Pf} {N)~cN 
for some constant c satisfying < c < 1. 

Here, and throughout the article, for functions / : K — > R and g : K — > R+, 
we write / ~ g to mean f(x)/g(x) — > 1 as a; — > oo. 

In the last section of the article, we consider some approaches to bounding 
the number of terms in P that have primitive divisors. For example, we will 
furnish a simple proof that 

liminf^^I. 

We have been unable to find a proof of Conjecture 12.21 In section 3 we show 
how other kinds of arguments can be marshalled in its support, and in section 4 
we discuss briefly the nature of the constant c. 

Linear recurrence sequences. To set matters in a more general context, 
define L — (L„)„^i to be a linear recurrence sequence of order k (k ^ 1) if it 
satisfies a relation 

Ln+k = Ck-iLn+k-i H 1- c a L n (n = 1, 2, . . . ) (2) 

for constants Cq, . . . ,Cfc_i, but satisfies no shorter relation. When k — 3 (re- 
spectively, k = 4), the sequence L is called a ternary (respectively, quaternary) 
linear recurrence sequence. For example, the sequences P considered in the pre- 
vious section are all ternary linear recurrence sequences. Theorem 12.11 shows 
that Zsigmondy's theorem cannot extend to these quadratic sequences. Some 
nonpolynomial sequences that cannot satisfy Zsigmondy will now be presented. 

With u and v again denoting conjugate quadratic integers, the integer se- 
quence W(u,v) = (W n (u,v)) n ^i defined by 

W n (u,v) = (u n -l)(v n -l) 

is always a linear recurrence sequence. 

Example 1. The sequence B = -W{2 + ^/3, 2- V3) begins 

2, 12, 50, 192, 722, 2700, 10082, 37632, 140450, 524172, 

and it is a ternary sequence satisfying 

Bn+3 — 5-B„+2 — 5B 1l+ i + B n . 

From the seventh term on, all the terms of the sequence seem to have primitive 
divisors. 
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Example 2. The sequence C — — W(l + V%, 1 — V%) begins 

2, 4, 14, 32, 82, 196, 478, 1152, 2786, 6724, 

and it is a quaternary sequence satisfying 

C n +4 = 2C n +z + 2C n +2 — 2C„+i — C„. 

In contrast to the previous example, the terms Cik for odd k do not have 
primitive divisors. 

In general, when uv = —1, the terms W2k(u, v) for odd A; fail to yield prim- 
itive divisors. This is because an easy calculation reveals that 

W 2k (u,v) = -W k (u,v) 2 

when k is odd. On the other hand, we recommend the following as an exercise: 
when uv = 1, the terms W n (u, v) do produce primitive divisors from some point 
on. As far as we can tell, to establish this requires Schinzel's extension JS] 
of Zsigmondy's result to the algebraic setting. (We are indebted to Professor 
Gyory for communicating to us the remarks about W(u,v).) 

All of these special cases can be subsumed into a wider picture. Write 

f(x) = X k - Ck-lX^ 1 c 

for the characteristic polynomial of the linear recurrence relation in J2J) . Then / 
can be factored over C, 

f{x) = {x - ai ) ei . . . (x - a d ) ed . 

The algebraic numbers ai, ...,ay, are known as the characteristic roots (or 
just roots) of the sequence. The terms L n of any sequence L satisfying the 
relation can be written 

d 

i=l 

for polynomials g±, . . . , <7<j of degrees e± — 1, . . . , — 1 with algebraic coefficients. 

The roots of the sequences in Examples ^ and [21 are quite different in charac- 
ter. In general, if uv = 1, then W(u,v) is a ternary linear recurrence sequence 
with roots l,it, andi>. When = —1, W{u, v) is quaternary with roots 1,-1, u, 
and v. For the quadratic sequence defined by P n = n 2 + f3, a = 1 is a triple 
root of the associated characteristic polynomial. 

It seems reasonable to conjecture that the terms of an integral linear recur- 
rence sequence of order greater than one will have primitive divisors from some 
point on provided that its roots are distinct and no quotient cq / ctj of different 
roots is a root of unity. 
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3 HEURISTIC ARGUMENTS. There are a number of ways that mathe- 
maticians form a view on which statements are likely to be true. These views 
inform research directions and help to concentrate effort on the most fruitful 
areas of enquiry. 

The only certainty in mathematics comes from rigorous proofs that adhere 
to the rules of logic: the discourse of logos. When such a proof is not available, 
other kinds of arguments can make mathematicians expect that statements will 
be true, even though these arguments fall well short of proofs. These are called 
heuristic arguments — the word comes from the Greek root EvpqKa (Eureka), 
meaning "I have found it." It usually means the principles used to make de- 
cisions in the absence of complete information or the ability to examine all 
possibilities. In informal ways, mathematicians use heuristic arguments all the 
time when they discuss mathematics, and these are part of the mythos discourse 
in mathematics. 

One consequence of the prime number theorem is the following statement: 
the probability that N is prime is roughly 1/ log N. What this means is that if 
an element of the set {1, . . . , N} is chosen at random using a fair iV-sided die, 
then the probability pn that the number chosen is prime satisfies Pn log N — > 1 
as N — > oo. This crude estimate has been used several times to argue heuris- 
tically in favor of the plausibility of conjectured solutions of difficult problems. 
Some examples follow. In each case the argument presented falls well short of a 
proof, yet it still seems to have some predictive power and has suggested lines 
of attack. 

Fermat primes. Hardy and Wright [U sec. 2.5] argued along these lines that 
there ought to be only finitely many Fermat primes. A Fermat prime is a prime 
number in the sequence (F n ) of Fermat numbers: 



Fermat demonstrated that ^1,^2,^3, and F4 are all primes; Euler showed 
that F 5 is composite by using congruence arguments. Since then, many Fer- 
mat numbers have been shown to be composite and quite a few have been 
completely factored. Wilfrid Keller maintains a web site ^21 with details of the 
current state of knowledge on factorization of Fermat numbers. The number of 
Fermat primes F n with n < N, if they are no more or less likely to be prime 
than a random number of comparable size, should be roughly 



Statements like this cannot be taken too literally, for the numbers F n have 
many special properties, not all of which are understood. However, this kind of 
argument tends to support the belief that there are only finitely many Fermat 
primes and would incline many mathematicians to attempt to prove that state- 
ment rather than its negation. Massive advances in computing power suggest 
that we know — indeed, that Fermat knew — all the Fermat primes. 



F n = 2 2 " + 1. 
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Mersenne primes. The prime number theorem can also be used to argue in 
support of the Mersenne prime conjecture. A heuristic argument of the following 
form is used. First, 2 k — 1 can be prime only for k a prime, so assume now that k 
is a prime p. We would like to estimate the probability that 2 P — 1 is prime. The 
prime number theorem suggests that a random number of the size of 2 P — 1 is 
prime with probability 1 / log(2 p — 1) , which is around 1 jp log 2. However, 2 P — 1 
is far from random: it is not divisible by 2, nor by 3, and indeed not by any prime 
smaller than 2p. Arguing in this way suggests that the probability that 2 P — 1 
is prime is approximately 

1 2 3 5 _q_ 
Pp ~plog2' l"2*4 q-1' [ ) 

where q is the largest prime less than 2p. This suggests that the expected 
number of Mersenne primes M n with n < N is roughly J2 p <n Pp- 

Since p p > l/plog2, the sum diverges by Mertens's theorem (see Q for 
a precise statement), which suggests that there are infinitely many Mersenne 
primes. Wagstaff (22J and then Pomerance and Lenstra ^H] have extended this 
heuristic argument by including estimates for the product of rationals in J3J 
to obtain an asymptotic estimate that closely matches the available evidence. 
On the basis of these heuristics, they conjecture that the number of Mersenne 
primes M n with n < N is asymptotically 

log A^, 



log 2 

where 7 is the Euler-Mascheroni constant. Caldwell's Prime Page 4 gives more 
details about these arguments and about the hunt for new Mersenne primes. 

Bilinear recurrence sequences. Consider now the Somos sequences dchncd 
by the recurrence Q . General results about heights on elliptic curves show that 
the growth rate of S n is quadratic-exponential. In other words, 

\ogS n — hn 2 , 

where h is a positive constant. Thus the expected number of prime terms 
with n < N should be approximately 

\p 1 1 \^ 1 71-2 
^ logS„ ~ h ^ ^ 6^' 

n<N 6 " n<N 

This resembles the argument of Hardy and Wright and suggests that only finitely 
many prime terms should be expected. Proofs of the finitcncss in many spe- 
cial cases have subsequently been found [H]. The search for these proofs was 
motivated in part by the heuristic arguments. Interestingly, it is known that 
the constant h is uniformly bounded below across all nonlinear integral Somos 
sequences. Thus the style of this heuristic argument suggests that perhaps the 
total number of prime terms is uniformly bounded across all such sequences. 



8 



Extensive calculation has failed to yield a sequence with more than a dozen 
prime terms. 

Quadratic polynomials. Suppose (3 is an integer that is not the negative of 
a square and recall the sequence P given by P n — n 2 + [3. Again, the prime 
number theorem predicts that there are roughly 



prime terms in the sequence P with n < N ', assuming again that P n is neither 
more nor less likely to be prime than a random number of that size. The sum is 
asymptotically N/ (2logN), which supports the belief that there are infinitely 
many prime terms in the sequence P. Computation suggests that for fixed (3 
there will be dN/ logiV prime terms with n < N, where d = d{(3) is a constant 
that depends upon (3. Bateman and Horn [2] offered a heuristic argument and 
provided numerical evidence to suggest that 



where the product is taken over all primes and w(p) denotes the number of 
solutions x modulo p to the congruence x 2 = —(3 (mod p). 

4 BIASED NUMBERS. We now return to the problem of looking for prim- 
itive prime factors in the sequence given by P n = n 2 + (3 with —j3 not a square. 
Since we are mainly interested in asymptotic behaviour, we assume from now 
on that n > |/3|. The terms P n with n ^ \f3\ are not guaranteed to exhibit the 
behavior described in this section. 

Lemma 4.1. A prime p is a primitive divisor of P n if and only if p divides P n 
and p > In. 

Proof. Consider first a prime p dividing P n with p < n. Then, by assump- 
tion, P n = (mod p) , so P m = (mod p) for some m smaller than p simply by 
choosing m to be the residue of n modulo p. Because p < n, m < n. In others 
words, p is not a primitive divisor of P n . 

This means that to find primitive divisors of P n we have to look for prime 
divisors that are greater than n. (Note that n does not divide P n , as n > \j3\.) 
We can say more: we can guarantee a solution of P m = (mod p) for some m 
satisfying m ^ p/2. Thus, to find primitive divisors we have to look only 
amongst the prime divisors that are bigger than In (i.e. a prime p dividing P n 
is a primitive divisor only if p > 2n). 

Conversely, suppose that p is a prime dividing P n that is not a primitive 
divisor. Then n 2j r(3 = (mod p), and there is an integer m (< n) with m 2 +{3 = 
(mod p) , so (by subtracting the two congruences) m 2 — n 2 = (mod p) . It 
follows that m ± n = (mod p). In particular, 





p ^ m + n < 2n. 
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It follows that a prime p is a primitive divisor of P n if and only if p divides P n 
and p > 2n. □ 



We call an integer k is biased if it has a prime factor q with q > 2\fk. Thus 
any prime greater than three is biased. The numbers 22, 26, and 34 are biased, 
whereas 24 and 28 are not. 

Proposition 4.2. When n > |/3|, i/ie term P„ /lets a primitive divisor if and only 
if Pn is biased. If n > \(3\ and P n has a primitive divisor, then that primitive 
divisor is a prime, and it is unique. 

Proof. Part of the first statement comes from Lemma 14.11 To complete the 
proof of the first statement we claim that, for n greater than \/3\, P n has such a 
prime divisor if and only if P n is biased. If p is a prime dividing P n and p > 2n, 

then 

p ^ 2n + 1 > 2\jn 2 + n > 2\] n 2 + (3. 

Conversely, if p > 2y n 2 + (3, then 

p ^ 2\Jn 2 - n + 1 > 2n — 1, 

so p > 2n (since 2n cannot be prime) . 

The uniqueness of the primitive divisor follows at once. If p is a prime 
dividing P n and p > 2\fP^, then no other prime divisor can be as large, hence 
cannot be primitive. □ 

The requirement n > \/3\ is necessary: if |/3| is prime, then has primitive 
divisor \/3\ but is not biased. Also, terms with small n may have more than one 
primitive divisor. For example, the sequence of values of the polynomial n 2 + 6 
begins 7, 10, ... so the second term has two primitive prime divisors. The kind 
of results discussed here are asymptotic results, which makes this restriction 
unimportant. 

Proof of Theorem \2.1\ Results of Schinzel ^| Theorem 13] show that for any 
positive a, the largest prime factor of P n is bounded above by n a for infinitely 
many n. Taking a = 1, we conclude that P n is not biased infinitely often. By 
Proposition 14.21 P n fails to have a primitive divisor infinitely often. □ 

In section 5 of the article, we consider quantitative information about the 
frequency with which, rather than the extent to which, P n is not biased. 

Support for Coni ect ure 12.21 follows from Proposition 14. 21 because an asymp- 
totic formula can be obtained for the distribution of biased numbers. Along- 
side the earlier notation for describing the growth rates of various functions, 
we also use the following: Given functions / : R — > M and g : R — > R + , we 
write / = 0(g) to mean that \f(x)\/ g(x) is bounded and / = o(g) to signify 
that f(x)/g(x) — > as x — > oo. 

Theorem 4.3. If 7r,(AT) denotes the number of biased numbers less than or 
equal to N , then 

7T t (JV)~ ATlog2. 
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Proof. Write a biased number as qm, where q is its largest prime factor. The 
biased condition then translates to q > 4m. To compute the number of biased 
numbers below TV, note that the counting can be achieved by dividing the set 
into two parts. Let p denote a variable prime. When p < 2^J~N there are |p/4j 
biased integers pm smaller than A (here [x\ denotes the greatest integer less 
than or equal to x). When p ^ 2y/~N, each number pm smaller than A is biased, 
so there are [N /p\ biased integers pm. Hence the total number is 



p<2VlV 



+ £ 



The first sum is 0(N / log A) and can be ignored asymptotically. The second 
sum differs from 

n y i 

p 

by an amount that is 0(N/ log A) by the prime number theorem. To estimate 
this sum we use Mertens's formula, which can be found in Apostol's book 1 
Theorem 4.12]: 

5^- = logloga; + A + o(l). (4) 

p<x P 

Applying Q thus estimates ir l (N) as 



□ 



A log log A + A - log(log VN + log 2) - A + o(l 



which is asymptotically jVlog2. 



Theorem 14.31 can be applied to give the following heuristic argument in 
support of Conjecture 12.21 The probability that a large integer is biased is 
roughly log 2. Hence the expected number of biased values of n 2 +j3 with n < N 
is asymptotically AT log 2. Computational evidence suggests that the number of 
biased terms in n 2 + (3 is asymptotically cN for some constant c. Computations 
with |/3 1 < 10 suggest the constant c looks reasonably close to log 2 in each case, 
although convergence appears slow. 

5 COUNTING PRIMITIVE DIVISORS. The article concludes with some 
simple estimates for p | g(AT), the number of terms P n in the sequence P with n < 
N that have primitive divisors. The proofs use little aside from well-known 
estimates for sums over primes, which can be found in the book of Apostol 1 . 

Theorem 5.1. There is a constant C > such that 

CN 



P ^ <N - log A 
holds for all sufficiently large A. There is a constant D > such that 

N DN 

y ~ be a 

is true for all sufficiently large A. 



< Pp(N) 



(5) 



(6) 
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Both of the statements in Theorem 15.11 can be strengthened along the fol- 
lowing lines: any choice of constants C or D could be made. As each constant 
varies, so does the smallest value of N beyond which the inequalities become 
valid. 

Apart from a finite number of primes, any prime p that divides n 2 + j3 has 
the property that — (3 is a quadratic residue modulo p. Let 1Z denote the set of 
odd primes for which —/3 is a quadratic residue. Notice that 1Z comprises the 
intersection of a finite union of arithmetic progressions with the set of primes and 
that this finite union of arithmetic progressions in turn comprises exactly half of 
the residue classes modulo 4|/3|. We will prove the two parts of Theorem l5.1l in 
reverse order, because the upper bound © arises by specializing the argument 
used to prove the lower bound ©. 

Write 

N 

Qn = H \Pn\, 

71=1 

and denote by uj(Qn) the number of distinct prime divisors of Qn- By Proposi- 
tion it is sufficient to bound u>(Qn) because, with finitely many exceptions, 
a primitive divisor is unique. 

Proof of the lower bound. Define 

S = {peTZ: p\Q N and p < 2N}, S' = {p e TZ : p\Q N and p ^ 27V}. 
Let s = \S\ and s' = \S'\. We seek a lower bound for s + s' , since 

s + s' = lo{Q n ). 

The asymptotic form of Dirichlet's theorem 1 on primes in arithmetic progression 
implies that asymptotically half the primes lie in TZ, so 



N 



log 2N 

Therefore it is sufficient to estimate s' from below. 
Proof of equation From the definition of Qn, 



(7) 



N N I N \ 

logQA r = ^log> 2 + /?H2^(logn + 0(^)) = 2^1ogn 

n— 1 n— 1 \ 7i— 1 / 



0(1), 



1 In 1826 Dirichlet proved that if a and b are positive integers with no common factor, then 
there are infinitely many primes of the form ax + b with x in N. This result, which appeared 
in a memoir published in 1837 was proved using methods from analysis, thus laying the 
foundations for the subject now called analytic number theory. Writing -K a {X) for the number 
of primes of the form ax + b with x < X, Dirichlet proved that ■jTa(X) — > oo as X — > oo. There 
is also what might be called a prime number theorem for arithmetic progressions, which gives 
an asymptotic estimate for the number of such primes. It states that -rr a (X) ~ X/<j>(a) logX, 
where <j> is the Euler totient function. This was shown by de la Vallee Poussin; a proof can be 
found in the book of Prachar 1141 chap. 5, sec. 7]. It is this result that we are using here. 
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so by Stirling's formula 

\ogQN — 2NlogN — 2N + 0(1). (8) 
On the other hand, we can write 

Y e P l °SP = 1o sQn, (9) 
p\Qn 

for positive integers e p corresponding to the prime decomposition np|Q N P Cp 
of Qm- The first step in the proof is to identify a subset of TZ that contributes 
a fixed amount to the main term in |J8}. The sum on the left-hand side of 
can be decomposed to give 

e p \ogp+ Y e p logp+ ^2 \ogp = \ogQ N , (10) 

p&s,p<n pes,p^N pes' 

noting that e p = 1 whenever p ^ 2N. The second term in the decomposition 
is O(N), since e p ^ 2 for p in S with p > N, each term in the sum is no larger 
than log 2N, and the prime number theorem implies that there are 0(N/ log N) 
terms. Thus the second term does not contribute to the asymptotic behaviour. 
Assume for the moment that 

e P logp = N log N + 0(N). (11) 

p£S,p<N 

Combining ©, JXDJ, and ||TTJ gives 

N log N + 0(N) = Y lo SP < s'\ogP N = s'log{N 2 + (3). 
pes' 

Thus ||BJ follows at once, subject to the proof of (|11|) . □ 
Proof of equation l|ll|) . For each p in S 

p ^ p 

Hence there is a constant c > such that the left-hand side of l|ll|l is bounded 
below by 



pe5,p<w 

By Apostol P Theorem 7.3]. 



logp. 



Y [^Jlogp = iVlogiV + 0(iV). (12) 



p£S,p<N 



For p in S and fc in N, denote by ord p (fc) the exponent of the greatest power 
of p dividing k and put 

B P (N) ={n<N: ord p (P„) > 1}. 
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Then 



e p logp= [f L J 1 °SP+ E ( E ordp(P n ) - 1 j logp. 

peS,p<N peS : p<N P eS,p<N \ neB p (N) I 

We now show that the second term is asymptotically negligible. For p in S the 
number —f3 has two p-adic square roots, and ord p (P„) = r + 1 if and only if 
the p-adic expansion of n agrees with one of these square roots up to the term 
in p r and no further. Hence 



logp 



£ £ ord p (P„)-l logp < £ E- 2 [^ 

p£S,p<N \ neB p (N) ) pES,p<N \ r=l 

P es, P <N yF ' 

which is O(N) because the sum converges and s = 0(N/ log N) by Q. Putting 
this together with (|T2*)l establishes l(TT|) . □ 

Proof of the upper bound. This proof is similar to that for the lower bound. 
However, it relies on a finer partition of the set 1Z. Given integers K > 2 
and N > K, we split S' into the sets 

T={peK: p\Q Nl 2N < p < KN}, U = {pEK: p\Q N ,KN < p). 

Proof of equation J5J. Write t = \T\ and it = \U\. As before, the contribution 
from s is negligible. Thus we wish to bound the expression t + u from above. 
The sum on the left-hand side of © decomposes according to the definitions 
of S, T, and U to give 

e P lo SP + E logp + E logp = 1o &Qn, 

pes per P eu 

noting as earlier that e p = 1 whenever p > 2N. Equations (JSJ, (|10() . and 
reveal that 

logp + V" logp < N log N + aN 
per P eu 

for some positive a. The left-hand side is greater than 

t log N + u log(KN) , 
so we add t log K to both sides to obtain 

(t + u) log(KN) < N log N + aN + 1 log if. 
Rearranging the right-hand side gives 

(t + u) \og(KN) < N \og(KN) + (a- log K)N + t log K. 
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Assume that K is fixed, with C = log if — a > 0. Dividing through by log(AA) 
leads to 

« CN tlogK 

^ t+ ^ <N -^(m + ^Ny (13) 

The inequality — 1/(1 + x) < — 1 + x holds when x > 0. We apply this with x = 
log K/ log AT to the second term on the right of Ijl3(l to obtain the inequality 

CN CN „ ( N 
< 7—7 + 



log( AA) log( A) V (!og N) 

whose last term is asymptotically negligible. The last term on the right of l|13fl 
can be estimated by appealing to Dirichlet's theorem again, yielding 

tlogK =0 (t\ =0 ( N 



log(AA) \logN J VOog^O 2 
which is also asymptotically negligible. Hence for any C (0 < C < C), 

C'N 

uj{Q N )~t + u<N-- - 

log A 

for all large A. □ 

A slightly stronger result is provable with these methods, namely, that 

wvw N log log A 
P^< N ~ logN 

for all sufficiently large A. We leave this as an exercise to the interested reader. 
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